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FLP-MEDIATED GENE MODIFICATION IN MAMMALIAN CELLS 
AND COMPOSITIONS AND CELLS USEFUL THEREFOR ' 

This invention relates to recombinant DNA 
technology, in a particular aspect, this invention 
relates to methods for the site-specific recombination of 
DNA in mammalian cells or host mammalian organisms, in 
5 another aspect, the present invention relates to novel 
DNA constructs, as well as compositions, cells and host 
organisms containing such constructs. In yet another 
aspect, the present invention relates to methods for the 
activation and/or deactivation of expression of 

10 functional genes. In a further aspect, the present 

invention relates to methods for the introduction of DNA 
into specific sites in the genome of mammalian cells. In 
a still further aspect, the present invention relates to 
gene therapy methods. In still another aspect, the 

15 present invention relates to means for the recovery of 

transfected DNA from a cell or host organism. In a still 
further aspect, the present invention relates to assay 
methods. 

20 

BACKGROUND OF THE INVENTION 

Many recent manipulations of gene 
expression involve the introduction of transfected genes 

25 (transgenes) to confer sotoe novel property upon, or to 
alter some intrinsic property of, mammalian cells or 
organisms. The efficacy of such manipulations is often 
impaired by such problems as the inability to control the 
chromosomal site of transgene integration; or the 

30 inability to control the number of copies of a transgene 
that integrate at the desired chromosomal site; or by 
difficulties in controlling the level, temporal 
characteristics, or tissue distribution of transgene 
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expression; or by the difficulty of modifying the 
structure of transgenes once they are integrated into 
mammalian chromosomes. 

Transgenes are often introduced into 
5 mammalian cells or organisms to determine which 

components of a transgene are required for specific 
qualitative or quantitative alterations of the host 
system. Since both chromosomal position and copy number 
are major determinants of transgene function, the 
10 usefulness of these analyses is limited because current 
techniques for efficiently introducing transgenes into 
mammalian hosts result in the insertion of a variable 
number of transgene copies at random chromosomal 
positions. Xt is, therefore, difficult (if not 
15 impossible) to compare the effects of one transgene to 
those of another if the two transgenes occupy different 
chromosomal positions and are present in the genome at 
different copy numbers. Considerably more refined 
analyses would be possible if one could routinely 
20 introduce single copies of a variety of transgenes into a 
defined chromosomal position. 

The spatial or temporal characteristics of 
transgene expression is difficult to control in intact 
organisms. The restricted expression of transgenes is 
25 potentially of great interest, as this technique can be 
employed for a variety of therapeutic applications, e.g., 
for the selective interruption of a defective gene, for 
the alteration of expression of a gene which is otherwise 
over-expressed or under-expressed, for the selective 
30 introduction of a gene whose product is desirable in the 
liost, for the selective removal or disruption of a gene 
whose expression is no longer desired in the host, and 
the lite. 

Transgene expression is typically governed 
35 by a single set of control sequences, including promoters 
and enhancers which are physically linked to the 
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transgenes (i.e., cis-acting sequences) . Considerably 
greater expression control could be achieved if transgene 
expression could be placed under the binary control of 
these cis-acting sequences, plus an additional set of 
5 sequences that were not physically linked to the 

transgenes (i.e., trans-acting sequences). A further 
advantage would be realized if the transient activity of 
these trans-acting functions resulted in a stable 
alteration in transgene expression, in this manner, it 
10 would be possible, for example, to introduce into a host 
a transgene whose expression would have lethal or 
deleterious effects if it was constitutively expressed in 
all cells. This would be accomplished by delaying the 
expression of the transgene to a specific time or 
15 developmental stage of interest, or by restricting the 
expression of the transgene to a specific subset of the 
cell population. 

It is currently difficult (if not 
impossible) tp precisely modify the structure pf 
20 transgenes once they have been introduced into mammalian 
cells, in many applications of transgene technology, it 
would be desirable to introduce the transgene in one 
form, and to then be able to modify the transgene in a 
defined manner. By this means, transgenes could be 
25 activated or inactivated or the sequences which control 
transgene expression could be altered by either removing 
sequences present in- the original transgene or by 
inserting additional sequences into the transgene. 

Previous descriptions of recombinase- 
30 mediated rearrangement of chromosomal sequences in 
Drosophila and mammalian cells have not directly 
addressed the question of whether site-specific 
recombinases could routinely create a functional 
translational reading frame. Moreover, the reported: 
35 efficiency of the prior art recombinase system, in the 
only other description of site-specific recombination in 
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mammalian cells reported to date [based on Cre 
recombihase, described by Saner and Henderson in Nucleic 
Acids Research, Vol, 17: 147 (1989)] appears to be quite 
low. 

5 

BRIEF DESCRIPTION OP THE INVENTION 

In accordance with the present invention, 
10 we have developed a system for the selective modification 
of chromosomal or extrachromosomal DNA in mammalian 
cells. Selective modification can involve the insertion 
of one DNA into another DNA (e.g. , to create a hybrid 
gene, to activate a gene, to inactivate a gene, and the 
15 like), or the removal of specific DNA molecule (s) from 
other DNA molecule (s) containing the DNA to be removed 
(e.g., to inactivate a gene, to bring desired DNA 
fragments into association with one another, and the 
like) . 

20 The recombination system of the present 

invention is based on site-specific recombinase, FLP. In 
one application of the invention recombination system, 
FLP-mediated removal of intervening sequences is required 
for the formation of a functional gene. Expression of 

25 the functional gene therefore, falls under the control of 
both the regulatory sequences associated with the 
functional gene and also under the control of those 
sequences which direct FLP expression. 

The reverse of the above-described 

30 process, i.e., the FLP-mediated introduction of DNA, 
provides a convenient and selective means to introduce 
DNA into specific sites in mammalian chromosomes. 

FLP-mediated recombination of marker genes 
provides a means to follow the fate of various sequences 

35 over the course of development and/or from generation-to- 
generation. The recombination event creates a functional 
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marker gene. This gain-of -function system can be used 
for lineage analyses in a vide variety of tissues in 
different organisms. Prior to FLP-mediated 
recombination, the marker gene is normally silent, i.e., 
5 the phenotype typical of the marker is not observed. In 
the absence of FLP, spontaneous recombination to produce 
functional marker occurs only at very low frequencies. 
In the presence of FLP, functional marker is efficiently 
produced. In addition, this gain-of- function system is 

10 heritable and is easily detected by simple histochemical 
assays. For example, in transgenic mice, the lineages in 
which recombination is to occur can be controlled by 
appropriate selection of the promoters used to drive FLP 
expression. This could include promoters that are only 

15 transiently active at a developmental stage that 

substantially precedes overt cell differentiation. Since 
transcription of the marker gene is controlled by 
regulatory sequences associated therewith, functional 
marker genes can be expressed at later developmental 

20 stages, after cell differentiation has occurred. By this 
means, it is possible to construct a map for mammalian 
development that correlates embryonic patterns of gene 
expression with the organization of mature tissues. 

25 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 presents schematic diagrams of 
FLP-mediated recombination events* In Figure 1A, FLP- 
30 mediated introduction of DMA is illustrated, while in 

Figure IB, FLP-mediated removal of intervening sequences 
is illustrated. 

Figure 2 is presented in three parts. 
Figure 2A presents schematic diagrams of the expression 
35 vectors pFRT/JGAL, pNEO0GAL, and pOG44 FLP* Figure 2B 
presents a Southern blot of Hirt lysates prepared from 
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293 (human embryonic kidney) cells transfected with one 
microgram of pN£0£GAL and varying amounts of the pOG44 
FLP expression vector. Figure 2C graphically presents 
the 0-galactosidase activities in the same transf ections 
5 shown in part B, referred to above. 

Figure 3 A, at the top, presents a 
schematic of the pattern of plasmid integration in £25 
deduced from Southern blot analysis. Figure 3A, in the 
middle, presents the predicted pattern for 

10 £-galactosidase positive subclones of £25 if precise 

recombination across the FLP-recombination target sites 
occurs. Figure 3A, at the bottom, presents the predicted 
pattern for /J-galactosidase negative, neomycin resistant 
subclones of E25B2 after FLP mediated insertion of pOG45 . 

15 Figure 3B presents an analysis of genomic DMA from a cell 
line with a single integrated copy of pNBO^GAL (i.e., 
CVNEO0GAL/E25, designated as £25) , two derivative 
0-galactosidase-positive subclones (designated as E25B1 
and E25B2) , and two subcloneis derived from E25B2 after 

20 transf ection with p0645 (designated as B2N1 and B2N2) . 

DETAILED DESCRIPTION OF THE INVENTION 

25 In accordance with the present invention, 

there is provided a mammalian recombination system 
comprising: 

(i) FLP recombinase, or a nucleotide sequence 
encoding same, and 
30 (ii) a first DNA comprising a nucleotide 

sequence containing at least one FLP 
recombination target site. 
In accordance with another embodiment of 
the present invention, there are provided novel DNA 
35 constructs useful for the introduction of DNA into the 
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genoma of a trans fected organism, said DNA construct 
comprising, as an autonomous fragment: 

(a) at least one FLP recombination target 
i site, 

(b) at least one restriction endonuclease 
recognition site, 

(c) at least one marker gene, 

(d) a bacterial origin of replication, and 
optionally 

(e) a mammalian cellular or viral origin of 
DNA replication* 

In accordance with yet another embodiment 
of the present invention, there are provided novel DNA 
constructs useful for the rescue of DNA from the genome 
15 of a transf ected organism, said DNA construct comprising, 
as an autonomous fragment, in the following order, 
reading from 5» to 3 1 along said fragment: 

(a) a first FLP recombination target site, 

(b) . an insert portion comprising, in any 
20 suitable sequence: 

(1) at least one restriction endonuclease 
recognition site, 

(2) at least one marker gene, 

( 3 ) a bacterial origin of replication , 
25 and optionally 

(4) a mammalian cellular or viral origin 
of DNA replication, and 

(c) a second FLP recombination target site in 
tandem with said first FLP recombination 

30 target site. L 

In addition, there are provided methods for the recovery 
of transf ected DNA from the genome of a transf ected 
. organism employing the above-described constructs * 
In accordance with still another 
35 embodiment of the present invention, there is provided a 
method for the assembly of a functional gene (which is 
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then suitable for activation of expression) , in mammalian 
cells, by recombination of individually inactive gene 
segments derived from one or more gene(s) of interest, 
wherein each of said segments contains at least one 
5 recombina t ion target site, said method comprising: 

contacting said individually inactive gene 
segments with a FLP recombinase, under 
conditions suitable for recombination to occur, 
thereby providing a DNA sequence which encodes 
10 a functional gene of interest. 

In accordance with a further embodiment of 
the present invention, there is provided a method for the 
disruption of functional gene(s) of interest, thereby 
inactivating expression of such gene(s) , in mammalian 
15 cells, wherein said gene(s) of interest contain at least 
one FLP recombination target site, said method coaqprising 
contacting said gene(s) Of interest with: 

(i) a DNA segment which contains at least one 
. - FLP recombination target site, and 
20 (11) FLP recombinase; 

wherein said contacting is carried out under conditions 
suitable for recombination to occur between said gene and 
said DNA segment, thereby disrupting the gene(s) of 
interest and rendering said gene(s) non-functional, 
25 in accordance with a still further 

embodiment of the present invention, there is provided a 
method for the precisely targeted integration of DNA into 
the genome of a host organism, said method comprising: 
(i) introducing a FLP recombination target 
30 site into the genome of cells which are 

compatible with the cells of the subject, 
(11) introducing a first DNA comprising a 

nucleotide sequence containing at least 
one FLP recombination target site therein 
35 into the FLP recombination target site in 

the genome of said cells by contacting 
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said cells with said first DNA and FLP 
recombinase, and thereafter 
(iii) introducing the cells produced by the 
process of step (ii) into said subject, 
5 wherein the restating cells and/or 

organism have the optional ability to 
have DNA reproducibly and repetitively 
inserted into and/or recovered from the 
host cells and/or organism. 
10 In accordance with another aspect of the 

present invention , there are provided mammalian cells, 
wherein the genomic DNA of said cells contain at least 
one FLP recombination target site therein. 

In accordance with yet another aspect of 
15 the present .invention, there are provided transgenic, 

non-human mammals, wherein said mammals contain at least 
one FLP recombination target site in the genomic DNA 
thereof. 

In accordance with yet another aspect of 
20 the present invention, there is provided a method for the 
site-specific integration of trans feet ed DNA into the 
genome of the above-described cells and/or transgenic, 
non-human mammals, said method comprising: 

(i) contacting said genome with: 
25 (a) FLP recombinase, and 

(b) a first DNA comprising a 

nucleotide sequence containing 
at least one FLP recombination 
target site therein; and 
30 thereafter 

(ii) maintaining the product of Step (i) 
under conditions suitable for site- 
specific integration of said DNA 
sequence to occur at the FLP 

35 recombinat ion target site in said 

- genome.. 
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In accordance with a further aspect of the 
present invention, there is provided a method for the 
analysis of the development of a mammal, said method 
comprising: 

5 (a) providing a transgenic mammal comprising: 

(i) an expression construct encoding FLP 
under the co ntr ol of a conditional 
promoter, and 
(il) a reporter construct under the 
10 control of the same or a different 

promoter, wherein said reporter 
construct encodes a functional or 
non-functional reporter gene product, 
and wherein said construct contains 
15 at least one FLP recombination target 

site therein, 

wherein the functional 
expression of the functional reporter 
gene is disrupted when said FLP 
20 recombination event occurs, or 

wherein the functional 
expression of the non-functional 
reporter gene commences when said FLP 
recombination event occurs; and 
25 (b) following the development of said mamm al to 

determine when expression of functional reporter gene 
product either commences or is disrupted. 

In accordance with a still further aspect 
of the present invention, there is provided a co- 
30 transfection assay FLP-mediated recombination, said assay 
comprising; 

(a) co-transf ecting a host mammalian cell with: 

(i) a FLP expression plasmid, and 

(ii) a reporter plasmid comprising a 

35 reporter gene inactivated by the presence 
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of at least one recombination target site; 
and 

(b) monitoring said host cell under a variety 
of conditions for the gain of expression of functional 
5 reporter gene product. 

FLP recombinase is a protein which 
catalyzes a site-specific recombination reaction that is 
involved in amplifying the copy number of the 2j* plasmid 
of s.cerevisiae during DNA replication. FLP protein has 

10 been cloned and expressed in E.coli [see, for example, 
Cox, in proceedings of the National Academy of Sciences 
U.S.A., Vol. 80s 4223-4227 (1983)], and has been purified 
to near homogeneity [see, for example, Meyer-Iiean, et 
al., in Nucleic Acids Research, Vol. 15: 6469-6488 

15 (1987)]. FLP recombinases contemplated for use in the 
practice of the present invention are derived from 
species of the genus Saccharbmyces . Preferred 
recombinases employed in the practice of the present 
invention are derived from strains of Saccharomyces 

20 cerevisiae. Especially preferred recombinases employed 
in the practice of the present invention are proteins 
having substantially the same amino acid sequence as set 
forth in Sequence. I.D. No. 2, as encoded, for example, by 
Sequence I.D. No. 1, or the sequence set forth by Hartley 

25 and Donelson, Nature 2J&i 860 (1980) * 

The FLP recombination target site 
(sometimes referred to herein as "FRT") has also been 
identified as minimally comprising two 13 base-pair 
repeats, separated by an 8 base-pair spacer, as follows: 

30 

-Spacer- 

5 1 — GAAGTTCCTATTC f TCTAGAA A 1 GTAIAGGAACTTC— 3 1 
Xbal 
site 

-35. 

The nucleotides in the above "spacer" region can be 
replaced with any other combination of nucleotides, so 
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long as the two 13 base-pair repeats are separated by 8 
nucleotides . The actual nucleotide sequence of the 
spacer is not critical, although those of skill in the 
art recognize that, for some applications, it is 
5 desirable for the spacer to be asymmetric, while for 

other applications, a symmetrical spacer can be employed. 
Generally, the spacers of the FLP recombination target 
sites undergoing recombination with one another will be 
the same. 

10 As schematically illustrated in Figure 1A, 

contact of genomic DNA containing a FLP recombination 
target site (shown as the linear Psv— BEEA-GAL construct) 
with a vector containing a FLP recombination target site, 
in the presence of the protein, FLP recombinase, results 

15 in .recombination that forms a new genomic sequence 
wherein the vector sequences have been precisely 
incorporated into the genome of the host. The reverse of 
this process is shown schematically in Figure IB, wherein 
a genomic sequence or construct containing two tandemly 

20 oriented FLP recombination target sites, upon contacting 
with FLP, is recombined and the FLP recombination target 
site-bounded fragment is excised as a circular molecule. 

Genes of interest contemplated for use in 
the practice of the present invention can be selected 

25 from genes which provide a readily analyzable functional 
feature to the host cell and/or organism, e.g., visible 
markers (such as /3-galactosidase, thymidine kinase, 
tyrosinase, and the like) , selectable markers, (such as 
markers useful for positive and negative selection, e.g., 

30 genes for antibiotic resistance) , as well as other 
functions which alter the phenotype of the recipient 
cells, and the like. 

The first DNA employed in the practice of - 
the present invention can comprise any . nucleotide 

35 sequence containing at least one FLP recombination target 
site, which will precisely define the locus at which FLP- 
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mediated recombination will occur. The nucleotide 
sequence can comprise all or part of a gene of interest, 
as veil as other sequences not necessarily associated 
with any known gene. Optionally, for ease of later 
5 recovery of the gene of interest (in "activated 1 ' or 
modified form), the first DMA can optionally contain a 
second FLP recombination target site. 

The second DNA employed in the practice of 
the present invention is selected from at least a second 

10 portion of the first gene of interest or at least a 

portion of a second gene of interest (including an intact 
form of a second gene of interest) * When the second DNA . 
is at least a second portion of the first gene of 
interest, the site-specific recombination of the present 

15 invention may act to provide a functional combination of 
the different portions of the first gene of interest. 
Alternatively, when the second DNA is at least a portion 
of a second gene of interest, the site-specific 
recombination of the present invention may act to provide 

20 a functional hybrid gene, which produces a product which 
is not identical with either the product of the first 
gene or the second gene. As yet another alternative, 
when the second DNA is a portion of a second gene, the 
site-specific recombination of the present invention may 

25 act to disrupt the function of the first gene of 

interest. Based on the nature of the first DNA and the 
second DNA, as well as the mode' of interaction between 
the two, the site-specific interaction of the present 
invention may create or disrupt a feature which is 

30 colorimetrlcally. detectable, immunologically detectable, 
genetically detectable, and the like. 

In accordance with the present invention, 
assembly of a functional expression unit is achieved in 
any of a variety of ways, e.g., by association of the 

35 gene Qf interest with a functional promoter, by assembly 
of common gene fragments to produce a complete functional 
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gene (which, in combination with its promoter, comprises \ 
a functional expression unit) or assembly of diverse 
gene fragments from diverse sources to produce a 
functional, hybrid gene (which, in combination with a 
5 promoter, comprises a functional expression unit), and 
the like. Upon assembly of a functional expression unit 
as described herein, expression of the functional gene to 
produce a protein product can be activated in the usual 
manner. In the absence of FLP-mediated recombination, 

10 activation of expression, would fail to produce a 
functional protein product. 

In accordance with the present invention, 
dis-assembly of a functional expression unit is achieved 
in any of a variety of ways, e.g. , by dis-associating the 

15 gene of interest from a functional promoter, by dis- 
assembly (e.g., disruption) of the functional gene (e.g., 
by introduction of DNA which renders the entire sequence 
non-functional), by removal of a substantial portion of 
the coding region of said gene, and the like. Upon dis- 

20 assembly of a functional expression unit as described 
herein, expression of the functional gene product under 
the conditions required prior to gene dis-assembly is no 
longer possible. The ability of the expression unit to 
be activated for expression has therefore been disrupted. 

25 The gene in this situation can be said to be inactivated, 
since activation of expression is not possible. 

Individually inactive gene segments 
contemplated for use in the practice of the present 
invention are fragments which, alone, do not encode 

30 functional products. Such fragments can be derived from 
a first gene of interest alone, or from both a first and 
second gene of interest DNA fragments. 

When gene inactivation is desired, the 
gene of interest can be disrupted with a DNA fragment 

35 which .throws the gene of interest out of reading frame 

(e.g., an insert wherein the number of nucleotides is not 
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a multiple of 3), Alternatively, the gene of interest 
can be disrupted with a fragment which encodes a segment 
which is substantially dissimilar with the gene of 
interest so as to render the resulting product non- 
5 functional. As yet another alternative, the gene of 

interest can be disrupted so as to dis-associate the gene 
of interest from the transcriptional control of the 
promoter with which it is normally associated* 

The introduction of DNA, e.g., DNA 
10 encoding FLP recombination target sites, into the genome 
of target cells can be accomplished employing standard 
techniques, e.g., transfection, microinjection, 
electroporation, infection with retroviral vectors, and 
the like. 

15 Introduction of protein, e.g., FLP 

recombinase protein, to host cells and/or organisms can 
be accomplished by standard techniques, such as for 
example, injection or microinjection, transfection with 
nucleotide sequences encoding FLP, and the lilce. 
20 When employed for gene therapy of an 

intact organism, introduction of transgenic cells into 
the subject is accomplished by standard techniques, such 
as for example, grafting, implantation, and the like. 

Mamma lian cells contemplated for use in 
25 the practice of the present invention include all members 
of the order Mammalia, such as, for example, human cells, 
mouse cells, rat ceils, monkey cells, hamster cells, and 
the like. 

Host organisms contemplated for use in the 
30 practice of the present invention include each of the 
organism types mentioned above, with the proviso, 
however, that no claim is made to genetically modified 
human hosts (although the present invention contemplates/ 
methods for the treatment of humans) . 
35 Once FLP recombinase (or DNA encoding 

same) and DNA containing at least one FLP recombination 
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target site have been introduced into suitable host 
cells /organisms , the cells/host organisms are maintained 
under conditions suitable for the site-specific 
recombination of DMA. Such conditions generally involve 
5 conditions required for the viability of the host cell or 
organism. For in vitro manipulations, conditions 
employed typically involve low concentrations of a 
variety of buffers having a pH of between about 5-9 and 
ionic strengths in the range of about 50-350 mM. See, 

10 for example, Senecoff, et al., in JgSEDal of WQl^cBlag- 
Bioloov. Vol. 221* 405-421 (1988). 

In accordance with a particular aspect of 
the present invention, a co-trans f ection assay has been 
developed which can be used to characterize. FLP-mediated 

15 recombination of extrachromosomal DNA in a variety of 

cell lines. Cells are co-transfected with an expression . 
construct and a "reporter" plasmid that is a substrate 
for the recombinase . The expression construct encodes a 
FLP recombinase protein. The reporter plasmid encodes 

20 either a functional reporter gene containing at least one 
recombination target site therein, or a non-functional 
reporter gene containing at least one recombination 
target site therein. Upon expression of FLP by the. 
expression construct, the functional reporter gene will 

25 be rendered non-functional, or the non- functional 

reporter gene will be rendered functional. Thus, the 
activity of the expression construct can be assayed 
either by recovering the reporter plasmid and looking for 
evidence of recombination at the DNA level, or by 

30 preparing cytoplasmic extracts and looking for evidence 
of recombination at the protein level (i.e. , by 
measuring the expression of reporter gene activity 
generated by the recombined reporter) . Such assays are 
described in greater, detail in Example 1 below. 
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The invention will now be described in 
greater detail by reference to the following non-limiting 
examples. 

5 EXAMPLES 

Example 1 
Co-transf ection Assays 

10 The co-transf ection assay used to 

characterize FLP-mediated recombination of 
extrachromosomal DNA involved trans feet ion of cells with 
an expression construct and a "reporter" plasmid that was 
a substrate for the recombinase. The activity of the 

15 expression construct could be assayed either by 

recovering the reporter plasmid and looking for molecular 
evidence of recombination at the DMA level, or by 
preparing cytoplasmic extracts and looking for evidence 
of recombination at the protein level (i.e., by measuring 

20 l-galactosidase activity generated by recoanbined 
reporter) . 

The pNEO/3GAL reporter plasmid used for 
these assays was derived from pFRT/JGAL (Fig. 2A) . In the 
Figure, half-arrows indicate positions of FLP 

25 recombination target (FRT) sites; B and S designate EcoRI 
and Seal restriction sites, respectively; Psv designates 
early promoter, from SV40; BETA-GAL designates the 0- 
galactosidase structural sequence; NEO designates 
neomycin expression cassette; Pcmv designates the 

30 cytomegalovirus im mediat e early promoter; IN designates 
an intron; FLP designates a FLP coding sequence; AN 
designates an SV40 adenylation cassette; thin lines 
represent vector sequences; and the sizes of restriction 
fragments are indicated in kb. 

35 pFRTpGAL contains a version of the 

bacterial £ -galactosidase sequence modified, by insertion 
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of a FLP recombination target site, or FRT, within the 
protein coding sequence immediately 3 1 to the 
translational start. The oligonucleotide used for the 
construction of pFT/JRGAL was: 

5 

5 « -GATCCCGGG CTACCATGGA • GAAGTTCCTATTC • CGAAGTTCCTATTC 
f TOT AG A) AACTATAGGAACTTCA-3 ■ . 

This oligonucleotide contains an in-frame start codon, 
10 minimal FRT site, and an additional copy of the 13-bp FRT 
repeat [*XXX*]; the Xbal site within the FRT spacer is 
enclosed in parentheses. The linker was inserted between 
the BamHI and Hindlll sites of pSKSlOS (M.J. Casadaban, 
A. Martin-Arias, S.K. Shapira, and J. Chou, Mah. EnzymcL 
15 100, 293 (1983)) and the LacZ portion of modified gene 
was cloned into a pSV2 vector. The neomycin cassette 
used for construction of pNEO£GAL was an Xhol to BamHI 
fragment from pMOneo-polyA (K. Thomas and M. Capecchi, 
Cfeff 51:503 (1987)) cloned between copies of the J3 FRT 
20 site in pUC19. 

The FRT consists of two inverted 13 -base- 
pair (bp) repeats and an 8-bp spacer that together 
comprise the FRT site, plus an additional 13-bp 

repeat which may augment reactivity of the minimal 

25 substrate. The /J-galactosidase translational reading 
frame was preserved upon insertion of the FRT site, and 
the resulting plasmid, pFRT0GAL, generated robust 
activity in mammalian cells (Table 1) - 

pNEO/JGAL was constructed by cutting 

30 pFRT^GAL in the middle of the FRT site with Xbal and then 
inserting an Xbal fragment consisting of two half-FRT 
sites flanking a neomycin transcription unit. This 
created intact FRTs on either side of the neomycin 
cassette and rendered the /J-galactosidase transcription 

35 unit inactive (Table 1) . Precise FLP-mediated 

recombination of the FRTs caused the excission of the 



WOW/15694 PCT/US92/0189* 

-19- 

neomycin cassette, recreated the parental pFRT/JGAL 
plasmid, and restored /?-galactosldase expression. 

Co-transf ection of cells with a fixed- 
amount of pNE0£GAL reporter plasmid and increasing 
5 amounts of the pOG44 FLP expression vector generated 
increasing amounts of reconbined reporter plasmid and 
consequently, increased levels of 0-galactosidase 
activity. Molecular evidence for FLP-mediated 
recombination was obtained by recovering plasmids 36 

10 hours after transf ection, followed by endonuclease 

treatment (with EcoRI and Seal) and Southern blotting 
(see Fig. 2B; employing as a probe the fragment of 
PFRT0GAL indicated at the top of Fig. 2A) . Lysates of 
cells from cotransfections that included the pOG44 FLP 

15 expression vector showed a signal at 5.6 kb, the position 
at which recombined reporter (equivalent to pFRT£GAL) 
would run, and a 3.2 kb signal that was generated by 
unrecombined pNE0£GAL reporter (Fig. 2A) . The 5.6 kb 
band intensity was proportional to the amount of FLP 

20 expression plasmid included in the transf ection. The 5. 6 
kb band was not seen in cotransfections in which a non- 
FLP plasmid was substituted for the FLP expression vector 
(Fig. 2B) or in transfections thai: contained only pOG44 ( 
(and no reporter plasmid) . pOG44 generated additional 

25 signals at 2.2 kb and 2.8 kb because the plasmid used in 
its construction contained EcoRI and EcoRl-Scal fragments 
of such length. 

pOG44 consists of the cytomegalovirus 
immediate early promoter from pC0M8 [see Aruffo and Seed 

30 in Proc. Natl Acad. Sci. , USA a±z 8573 (1987) J, a 5 1 

leader sequence and synthetic intaron from pMLSIScat [see 
Huang and Gorman in Nucl. Acids Res. 18: 937 (1990) ], the 
FLP coding sequence (bp 5568-6318 and 1-626 of the 2/na 
circle, [see Hartley and Donelson, Nature 286 : 860 

35 (1980) ] and the SV40 late regioii polyadenylation signal 
from pMLSIScat. The following silent nucleotide 
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substitutions were introduced into the structural FLP 
sequence using the polymerase chain reaction: C for T at 
position 5791 ,. G for A at 5794, 6 for C at 5800, C for T 
at 55, 6 for A at 58, and C for T at 103. These changes 
5. eliminated three cannohical AATAAA polyadenylation 

signals and introduced a PstI restriction site without 
altering the amino acid sequence encoded by the 
nucleotide sequence. p062 8 consists of a murine cDNA for 
dihydrof olate reductase cloned into pCDM8 (Aruf f o and 
10 Seed, suprfr) . 

In the same samples, 0-galactosidase 
activity vas also proportional to the amount of FLP 
expression plasmid included (Fig. 2C) • Only background 
activities were observed in cotransf ections that included 

15 a non-FLP control plasmid (Table 1) or when pOG44 alone 
was transfected. The experiment thus provides both 
molecular and biochemical evidence for precise FLP— 
mediated recombination in mammalian cells. 

v Table 1 presents 0-galactosidase 

20 activities in cotransf ection assays of 293, CV-1, and F-9 
cells. Positive control transf ections (pFRT£GAL) 
included 1 pg of pFRT0GAL and 18 pq of the pOG28 non-FLP 
control plasmid; negative control transf ections 
(pNE0£GAL) included 1 /ig of pNEO^GAL and 18 jig of the 

25 p0G28; and experimental transf ections (pNEO£GAL + FLP) 
contained 1 fig of pNEO/JGAL and 18 /xg of the pOG44 FLP 
expression plasmid (Fig. 1A). The column headed by 
shows the pK£0/3GAL + FLP values as a percentage of the 
pFRTfiGAL positive control. Each value represents the 

30 mean for six plates from two experiments. Standard 
errors are in parentheses. Neither pOG28 nor pOG44 
generated 0-galactosidase activity when transf ected 
alone. All transf ections contained 1 pg Qt pRSVL [de Wet 
et al., Mol. Cell. Biol. 2: 725 (1987)) to correct p- 

35 galactosidase activities for relative transf ection 
efficiencies. 
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Subcohflueht cultures of cells In 10 cm 
dishes and grown in Dulbecco 9 s modified Eagle's medium 
(DMEM) and 5* calf serum were transfected by overnight 
exposure to calcium phosphate precipitates [Graham et 
5 al.. Virology 15:59 (1979)] and then split 1:4. After 24 
hours incubation, one plate of each trans feet ion was 
harvested by Hirt extraction [J; Mol Biol. 2£:365 (1967)] 
and a second plate was used to prepare cytoplasmic 
extracts [de Wet et al., fifflra]. Approximately 5% of the 

10 DNA recovered from single plates was used for Southern 
analyses, 0-galactosidase assays were performed as 
described by Hall et al., in J. Mol. Appl. Genet. 2:101 
(1983)]. Lucif erase activities generated by the 
inclusion of 1 jig of pRSVL (de Wet et al., supra ) in all 

15 trans feet ions were used to correct /?-galactosidase 

activities for relative transfection efficiencies. The 
experiment was repeated twice with similar results. 

TABLE 1: 0-GALACTOSIDASB ACT IVI T IE S (UNITS/ MG PROTEIN) 
20 IN COTRANSFEdtED CELLS 



25 



35 



40 



CELL LINE TRANSFECTIONS 



pFRT0GAL pNEO£GAL pNE0£GAL 

+ FLP 



30 293 30.4 (1.9) 0.17 (0.02) 14.2 (2.2) 47 

275 (25) 0.33 (0.06) 22.6 (1.2) 8 
F9 24.8 (4.3) 0.04 (0.01) 1.88 (0^02) 8 



FLP activity has also been demonstrated in 
monkey kidney (CV-1) and mouse embryonal carcinoma (F9) 
cells. In Table 1, the 0-galactosidase activity in the 
"pFRT/JGAL" transfections represents an estimate of the 
expression expected if all the pNEO/JGAL in a co- 
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transfection were immediately recombined. The highest p~ 
galactosidase expression in co-transf ections employing 
pNEO/JGAL plus p6G44, relative to pFRT0GAL transfected 
cells, was 47%, seen in 293 cells., This is a remarkable 
5 level considering that ^-galactosidase egression 

required both FIP expression, followed by recombination 
of pNEO£GAL, to produce a construct capable of expressing 
0-galactosldase. Co-transf ections of CY-1 and F9 cells 
generated 8% of the activity seen in the pFBIpG&Ir 
10 transf ections. Even at this lover relative activity, 
cotransfected cells were readily observed in { 
histochemical reactions; for ^-galactosidase activity. 

15 Example 2 

Fig-Mediated Removal of Intervening Sequences 

If the invention method is to be widely 
applicable, for example for gene activation in transgenic 

20 mammals, the ability of PLP to faithfully promote precise 
recombination at FLP recombination target sites contained 
in the mammalian genome is required. Such ability is 
demonstrated in this example. 

Cell lines that contain single integrated 

25 copies of pNB0£GAL (designated CVNBOpGAI/B) were produced 
by transf ecting CV-1 cells with linearized plasmid by 
electroporation, then isolated by selecting G418- 
resistant (G418*) transf ectants that stably expressed the 
neomycin cassette, and finally identifying single copy 

30 lines by Southern blot analyses (Fig. 3>. As previously 
shown for other integrated constructs with similarly 
short direct repeats, the chromosomal FRTs did not 
spontaneously recombine (in the absence of FLP) to 
produce a 0-g^lactosidase-positive (0GAL*) phenotype at 

35 detectable frequencies (Table 2) • 
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Transient expression of FLP in the 
CVNEO0GAI/E lines (by transiently transf ecting with the 
pOG44 FLP expression vector) promoted a rapid conversion 
to a /JGAL* phenotype. When five different lines were 
5 transiently transfected with the pOG44 FLP expression 
vector, 0-galactosidase activities at 36 hours were 40 to 
100-fold higher than those seen in replicate plates 
transfected with a non-FLP plasmld (Table 2) . At 48 
hours after transf ection histochemical processing showed 

10 many positive cells (Table 2). 

Table 2 presents the £-galactosidase 
phenotypes of CVNEO0GAI/E lines, which contain a single 
copy of the 0-galactosidase inactive reporter, pNEO0GAL, 
after transfection with FLP expression (p0644) , non-FLP 

15 negative control (pOG28) or 0-galactosidase positive 

control (pFRT/JGAL) plasmids. The pFRT^GAL transf actions 
included Iftq Of pFRT0GAL and 19/ig of pOG44; other mixes 
contained 20/ig of the indicated plasmid. £-galactosidase 
activities are mean values for triplicate transf ections 

20 performed as described for Fig. 2 and assayed 36 hours 
after removal of precipitates; standard errors for the 
pOG44 transf ections were less than 10% of the mean. The 
percent positive was determined by scoring more them 10 3 
cells after transfection and histochemical processing as 

25 described by de Wet et al., supra . 
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TABLE 2 

0-GALACTOSIDASB PHENOTYPES OF 
TRANSFECTBD CVNEO0GAL CELL LINES 



5 . 

rtgT.T. ACTIVITIES PERCENT POSITIVE 



10 


LINE 


(units/mg protein) 










p0628 


pOG44 


pOG28 


pFRT0GAL 


p0644 




E6 


0.24 


11.2 


. Of 


8.7 


6.1 




E25 


0.21 


16.7 


ot 


17.1 


12.4 


15 


E26 


0.18 


7.2 


Of 


19.5 


15.4 




E14 


0.28 


13.1 


ND 


ND 


ND 




E22 


0.09 


9.6 


ND 


ND 


ND 



fNo positive cells were found among >10 6 cells examined. 
20 ND: Not done. 



To provide some estimate of the efficiency 

25 of recombination, an additional set of replicate plates 
were transf ected with the pERT£GAL 0-galactosidase 
expression vector. Comparing the fractions of cells that 
were 0GAIH- in the pFRT^GAL and in the p0G44 trans feet ions 
(assuming similar transf ection efficiencies) suggests 

30 that most (70-80%) of the cells transacted with p0G44 
were converted to a 0GAL* phenotype (Table 2) • The 
comparison undoubtedly underestimates the efficiency of 
FLP-mediated excision. Whereas many copies of a 
functional 0-galactosidase gene were available for 

35 immediate transcription in the positive controls, 

recombination may have o ccur r e d shortly before harvest in 
some pOG44-transf ected cells. In these cases the single 
recombined reporter gene may not have generated enough 
l-galactosidase by the time of harvest to render the 

40 cells positive in this assay. 
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The 0GAL* phenotype was passed on to all 
descendents of many FLP-converted cells. Positive 
colonies were formed during prolonged expansion of 
individual colonies. Entirely negative colonies and 
5 mixed colonies were also observed. Mixed colonies would 
be expected if recombination occurred after mitosis in 
only one descendent of a transfected cell, or if 
recombined and unrecombined cells mixed at replating or 
during subsequent growth. Indeed, the physical 

10 segregation of phenotypes evident in most mixed colonies 
suggested that they were composed of stably positive and 
negative lineages. 

The correlation between £-galactosidase 
expression and recombination at FRT sites was examined by 

15 comparing the structure of the integrated pNE0£GAL 

sequences in two £GAL* subclones to the parental line. 
CVNEO/3 GAI/E2 5 (106) cells were transfected with the pOG44, 
FLP expression vector and subcloned 12 hours after 
removal of the precipitate. After histochemical 

20 screening, two /?GAlT subclones (E25B1 and E25B2) were 
expanded for further analysis . In Southern blots of 
genomic DNA from both subclones, the pattern of 
hybridization matched that expected for FLP-mediated 
recombination of the FRT sites in the parental line (Pig. 

25 3) . While recombination products have not been recovered 
and sequenced, these Southern analyses and the. fact that 
activation of /?-galactosidase expression required 
creation of a functional translational reading frame 
indicate that FLP-mediated recombination was precise. 



30 



WO 92/15694 



PCT/US92/01899 



-26- 

Example 3 
FIiP Mediated Recombination of FRT 
on an Extrachromosomal Molecule With a 
Chromosomally Integrated FRT 

5 

. Reversal of the process described in the 
previous Example, i.e., the FLP-mediated recombination of 
an FRT site on a plasmid with a chromosomally integrated 
FRT site, can be used to target the Integration of 

10 transfected plasm Ids to specific genomic sites. To 
determine the frequency at which this occurs, G418- 
sensitive, /?GAlT B25B2 cells were co-transfected with the 
p0644 FLP expression vector and a plasmid, pOG45, that 
contained a neomycin resistance gene expression cassette 

15 and a single FRT. pOG4S consisted of the neomycin 

resistance cassette and 3 ' FRT from pNB0j3GAL cloned into 
pUC19. 8 x 10 5 CVNEO0GAL cells were transfected by 
electroporation in 800 Ml of saline containing 40 fig of 
p0644 and 0.1 of either pOG45 or, for a negative 

20 control, pOG45A (which was derived from pOG45 by deleting 
a 200 bp fragment containing the FRT) • 

G418* subclones (designated B2N) from three 
transfections that had stably integrated pOG45 were 
histochemically stained for 0-galactosidase activity and 

25 more than half (104 of 158, or "66%) were either entirely 
0-galactosidase-negative (pGAL) or predominantly pG&L 
with a few clusters of £GAL* cells. The remaining 
colonies were 0GAL*. With continued passage as dispersed 
monolayers, the fraction of 0GAL* cells in the mosaic 

30 lines rapidly diminished. This suggested they were 6418* 
sensitive cells that initially survived because of their 
proximity to resistant cells; this was confirmed by 
reconstitution experiments. All of the 55 colonies 
formed after parallel co-trans f ections of pOG44 and a 

35 derivative of pOG45 (p0645A) that lacked an FRT were 
^GAL*. > 
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Tha correlation between loss of 
p-galactosidase activity and recombination between 
plasmid and chromosomal FRTs was examined in Southern 
analyses. Because the FRT and neomycin cassette of pOG45 
5 were derived from the neomycin cassette and 3 1 FRT of 

pNEO£GAL (Fig. 2A) , recombination of the plasmid FRT with 
the E25B2 chromosomal FRT regenerates the 3.2 kb EcoRI 
fragment of the original CVHBO0GAI/E25 parent. 
Additionally, the 8.5 kb junctional fragment of 

10 CTNBO0GAVB25 shifts to 12.0 Kb because pOG45 is 3.5 kb 
larger than the neomycin cassette of pNE0£GAL. The 3.2 
kb EcoRI fragment and the 8.5 kb junctional fragment were 
observed in each of the 10 cell lines analyzed after 
initial histochemical classification as pGKL or mosaic, 

15 as shown for two such lines in Fig. 3B. In contrast, 
e&ch of the four 0GAIH- colonies examined by Southern 
analyses showed that p0645 had integrated at a random 
site. 

These data show that FLP-mediated 

20 recombination will target the integration of trans fected 

DKA to a specific chromosomal site at frequencies that 

exceed those of random integration, and that the event 

can be marked by the alteration in gene activity at the 

target site. The efficiency of targeted integration can 

25 be increased by standard optimization techniques, such as. 

for example, by using ratios of the integrating plasmid 

and FLP expression vectors different from- the single 

ratio mixture used here, or by using FRT mutations in the 

plasmid and chromosomal sites to decrease the frequency 

30 with which successfully integrated plasmids are 

subsequently excised. 

• ' ' ' / 

While the invention has been described in 

detail with reference to certain preferred embodiments 

thereof, it will be understood that modifications and 

35 variations are within the spirit and scope of that which 

is described and claimed. 
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SUMMARY OF SEQUENCES 

Sequence I.D. No. 1 is the approximately 
1450 base-pair sequence encoding a FLP recombinase 
5 contemplated for use in the practice of the present 
invention, as veil as the amino acid sequence deduced 
therefrom. 

Sequence I.D* No. 2 is the amino acid 
sequence deduced from the nucleotide sequence of Sequence 
10 ID No. 1. 
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SEQUENCE LISTING 1 



(1) GENERAL INFORMATION: 

(1) APPLICANT: UAHL, DR f GEOFFREY H 

0' GORMAN DR, STEPHEN V 

(ii) TITLE OF v INVENTION: FLP -MEDIATED GENE MODIFICATION IN 
MAMMAT.TAN CELLS, AND COMPOSITIONS AND CELLS USEFUL 
THERE FOR 

(ill) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: FITCH, EVEN, TAB IN & FLANNERY 

(B) STRE ET: 135 South LaSalle Street, Suite 510 

(C) CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: USA 

(F) ZIP: 60603 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1;0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) . FILING' DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT. INFORMATION: 

(A) NAME: REITER MR, STEPHEN E 

(B) REGIS TRATION NUMBER: 31192 

(C) REFERENCE/DOCKET NUMBER: 50730 

(ix) TELECOMMUNICATION INFORMATION : 
(A> TELEPHONE: (619) 552-1311 

(B) TELEFAX: (619) 552-0095 

(C) TELEX: 20 6566 PATLAW CGO 
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(2) INFORMATION FOR SEQ ID N0:1: 

(I) SEQUENCE CHARACTERISTICS: 

(A) -LENGTH: 1380 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: llnpnr 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: NATIVE FLP 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1269 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATG CCA CAA TTT GAT ATA TTA TGT AAA ACA CCA CCT AAG GTG CTT GTT 48 
Met Pro Gin Phe Asp He Leu Cys Lys Thr Pro Pro Lys Val Leu Val 
1 5 10 15 



CGT GAG TTT GTG GAA AGG TTT GAA AGA CCT TCA GGT GAG AAA ATA GCA 96 
Arg Gin Phe Val Glu Arg Phe Glu Ag Pro Ser Gly Glu L^s He Ala 



TTA TGT GOT GCT GAA CTA ACC TAT TTA TGT TGG ATG ATT ACA CAT AAC 144 
Leu Cys Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met He Thr His Asn 
35 — 40 45 

GGA ACA GCA ATC AAG AGA GCC ACA TTC ATG AGC TAT AAT ACT ATC ATA 192 
Gly Thr Ala lie Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr He lie 
50 55 60 

AGC AAT TCG CTG ACT TTC GAT ATT GTC AAT AAA TCA CTC CAG TTT AAA 240 
Ser Asn Ser Leu Ser Phe Asp He Val Asn Lys Ser Leu Gin Phe Lys 
65 . 70 75 80 

TAG AAG AGG CAA AAA GCA ACA ATT CTG GAA GCC TCA TTA AAG AAA TTG 288 
Tyr Lys Thr Gin Lys Ala Thr lie Leu Glu Ala Ser Leu Lys Lys Leu 
85 90 95 

ATT CCT GCT TGG GAA TTT ACA ATT ATT CCT TAG TAT GGA CAA AAA CAT 336 
lie Pro Ala Trp Glu Phe Thr He He Pro Tyr Tyr Gly Gin Lys His 
100 105 HO 

CAA TCT GAT ATC ACT GAT AIT GTA ACT ACT TIG CAA TTA CAG TTC GAA 384 
Gin Ser Asp He Thr Asp lie Val Ser Ser Leu Gin Leu Gin Phe Glu 
115 120 125 

TCA TCG GAA GAA GCA GAT AAG GGA AAT AGC CAC ACT AAA AAA ATG CTT 432 
Ser Ser Glu Glu AT« Asp Lys Gly Asn Ser His Ser Lys Lys Met Leu 
130 S5 140 

AAA GCA CTT CTA ACT GAG GGT GAA AGG ATC TGG GAG ATC ACT GAG AAA 480 
Lys Ala Leu Leu Ser Glu Gly Glu Ser He Trp Glu He Thr Glu Lys 
145 150 155 160 
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ATA CTA AAT TOG TTT GAG TAT ACT TCG AGA TTT ACA AAA ACA AAA ACT 
He Leu Asn Ser Phe Glu Tyx Thr Ser Are Phe Thr Lys Thr Lys Thr 

170 " 



165 



1*5 



TTA TAC CAA TTC CTC TTC CIA GCT ACT TTC ATC AAT TGT GGA AGA TTC 
Leu Tyr Gin Phe Leu Phe Leu Ala Thr Fhe He Asn Cys Gly Arz Phe 
180 185 7 190 * 

AGC GAT ATT AAG AAC GTT GAT CCG AAA TCA TTT AAA TTA GTC CAA AAT 
Ser Asp He Lys Asn Val Asp Pro Lys Ser Phe Lys Leu Val Gin Asn 
195 200 205 

AAG TAT CTG GGA GTA ATA ATC GAG TGT TTA GTG ACA GAG ACA AAG ACA 
Lys TVr Leu Gly Val He He Gin Cys Leu Val Thr Glu Thr Lys Thr 
210 215 220 

AGC GTT ACT AGG GAG ATA TAC TTC TTT AGC GGA AGG GGT AGG ATC GAT 
Ser Val Ser Arg His He Tjrr Phe Phe Ser Ala Arg Gly Arg He Asp 
225 230 235 240 

CCA CTT GTA TAT TTG GAT GAA TTT TTG AGG AAT TCT GAA CCA GTC CTA 
Pro Leu Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser Glu Pro Val Leu 
245 250 255 

AAA CGA GTA AAT AGG ACC GCC AAT TCT TCA AGC AAT AAA GAG GAA TAC 
Lys Arg Val Asn Arg Thr Gly Asn Ser Ser Ser Asn Lys Gin Glu Tvr 
260 265 270 

CAA TTA TTA AAA GAT AAC TTA GTC AGA TCG TAC AAT AAA GCT TTG AAG 
Gin Leu leu Lys Asp Asn Leu Val Arg Ser Tyr Asn Lys Ala Leu Lys 
275 280 285 

AAA AAT GCC CCT TAT TCA ATC TTT GGT ATA AAA AAT GGC CCA AAA TCT 
Lys £S? Ma Ser He Phe Ala He Lys Asn Gly Pro Lys Ser 

290 295 300 

CAC ATT GGA AGA CAT TTG ATG ACC TCA TTT CTT TCA ATG AAG GGC CTA 
His He Gly Arg His Leu Met Thr Ser Phe Leu Ser Met Lys Gly Leu 
310 315 J 320 



ACG GAG TTG ACT AAT GTT GTG GGA AAT TCG AGC GAT AAG CGT CCT TCT 
Thr Glu Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser 

325 330 • - 335 

GCC GTG GCC ACG ACA AOG TAT ACT CAT CAG ATA ACA GCA ATA CCT GAT 
Ala Val Ala Arg Thr Thr Tyr Thr His Gin lie Thr Ala He Pro Asp 
340 . 345 350 

CAC TAC TTC GCA CTA GTT TCT CGG TAC TAT GCA TAT GAT CCA ATA TCA 
Hts Tyr Phe Ala Leu Val Ser 

AAG GAA ATC ATA GCA TTG AAG GAT GAG ACT AAT CCA ATT GAG GAG TGG 
Lys Glu Met He Ala Leu Lys Asp Glu Thr Asn Pro lie Glu Glu Trp 
"0 375 380 

CAG CAT ATA GAA CAG CTA AAG GGT ACT GCT GAA GGA AGC ATA CGA TAC 
Gin His He Glu Gin Leu Lys Gly Ser Ala Glu Gly Ser He Arg Tyr 

385 390 "' ' '• >4 """ 



— ww» WU WXX wuA A1A 1UI 

Arg Tyr Tyr Ala Tyr As| Pro He Ser 



528 



576 



624 



672 



720. 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



395 



400 
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CCC GCA TGG AAT GGG ATA ATA TCA CAG GAG GTA CXA GAC TAC CTT TCA 1248 
Pro Ala Trp Asn Gly lie lie Ser Gin Glu Val Leu Asp Tyr Leu Ser 
F 405 410 415 

TCC TAC ATA AAT AGA CGC ATA TAAGTACGCA TTTAAGCATA AACACGCACT 1299 
Ser Tyr lie Asn Arg Arg lie 
420 

ATGCCGTTCT TCTCATGTAT A T ATATATAC AGGCAACACG CAGATATAGG TGCGACGTGA 1359 
ACAGTGAGCT GTAIGTGCGC A 1380 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGT H: 423 amino acids 
(BY TYPE: amino add 
(D) TOPOLOGY: linear 

(11) HOLECOLE TYPE: protein 

(xl) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Pro Gin Phe Asp lie Leu Cys Lys Thr Pro Pro Lys Val Leu Val 
1 5 10 15 

Arg Gin Phe Val Glu Arg Phe Glu Axj Pro Ser Gly Glu Ljs lie Ala 

Leu Cys Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met lie Thr His Asn 

-35 ' 40 45 

Gly Thr Ala lie Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr lie lie 
50 55 60 

Ser Asn Ser Leu Ser Phe Asp lie Val Asn Lys Ser Leu Gin Phe Lys 
65 70 75 80 

Tyr Lys Thr Gin Lys A 1fl Thr lie Leu Glu Ala Ser Leu Lys Lys Leu 
3 J B5 90 95 

lie Pro Ala Trp Glu Phe Bir lie lie Pro Tyr Tyr Gly Gin Lys His 
100 105 110 

Gin Ser Asp lie Thr Asp He Val Ser Ser Leu Gin Leu Gin Phe Glu 
115 120 125 

Ser Ser Glu Glu Ala Asp Lys Gly Asn Ser His Ser Lys Lys Met Leu 
130 135 140 

Lys Ala Leu Leu Ser Glu Gly Glu Ser He Trp Glu He Thr Glu Lys 
145 150 155 166 

lie Leu Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr Lys Thr Lys Thr 
165 170 175 

Leu Tyr Gin Phe Leu Phe Leu Ala Thr Phe He Asn Cys Gly Arg Phe 
J 180 185 190 
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Ser Asp He lys Asn Val Asp Pro Lys Ser Phe Lys Leu Val Gin Asn 
195 200 205 

Lys Tyr Leu Gly Val He He Gin Cys Leu Val Thr Glu Thr Lys Thr 
210 215 220 

Ser Val Ser Arg His lie Tyr Phe Phe Ser Ala Arg Gly Arg lie 
225 230 235 2' 

Pro Leu Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser Glu Pro Val Leu 
245 250 255 

Lys Arg Val Asn Arg Thr Gly Asn Ser Ser Ser Asn Lys Gin Glu Tyr 
260 265 270 

Gin Leu Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn Lys Ala Leu Lys 
275 280 285 

Lys Asn Ala Pro Tyr Ser lie Phe Ala He Lys Asn Gly Pro Lys Ser 
290 295 300 

His He Gly Arg His Leu Met Thr Ser Phe Leu Ser Met Lys Gly Leu 
305 310 315 320 

Thr Glu Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser 
325 330 335 

Ala Val Ala Arg Thr Thr TYr Thr His Gin lie Thr Ala lie Pro Asp 
340 345 350 

His TVr Phe Ala Lou Val Ser Arg Tyr Tyr Ala Tyr Asp Pro lie Ser 
355 _ 360 365 

Lys Glu Met He Ala Leu Lys Asp Glu Thr Asn Pro He Glu Glu Trp 
370 375 380 

Gin His He Glu Gin Leu Lys Gly Ser Ala Glu Gly Ser He Arg 
385 390 395 4i 

Pro Ala Trp Asn Gly He lie Ser Gin Glu Val Leu Asp Tyr Leu Ser 
405 410 415 



Ser Tyr lie Asn Arg Arg He 
420 
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THAT WHICH IS CTATMKD IS: 

1. A mammalian recombination system 
comprising; 

5 (i) FLP recombinase, or a nucleotide sequence 

encoding same, and 

(ii) a first DNA comprising a nucleotide 
sequence containing at least one FLP 
recombination target site therein. 

10 

2. A recombination system according to Claim 1 
further comprising: 

(iii) a second DNA, herein said second 
DMA is selected from: 

15 (a) at least a second portion of said 

first gene of interest, or 
(b) at least a portion of a second gene 
of interest; 

wherein said second DNA contains at least one FLP 
20 recombination target site; and wherein said second DNA, 
when combined in reading frame with said first DNA, 
provides a functional gene. 

3* A recombination system according to Claim 2 
25 wherein said second DNA comprises an additional portion 
of said first gene of ixiterest. 

4. A recombination system according to Claim 2 
wherein said, second DNA comprises at least a portion of a 

30 second gene of interest. 

5. A recombination system according to Claim 4 
wherein said portion of said second gene of interest. 
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when combined in reading frame with said first DNA, 
provides a hybrid , functional gene. 

6. A recombination system according to claim 4 
5 wherein said portion of said second gene of interest, 
when combined with said first DNA, disrupts the function 
of said first gene of interest. 

l/ A recombination system according to Claim 1 
10 " wherein said first DNA further comprises a second FLP 
recombination target site. 

8. A recombination system according to Claim 1 
wherein the FLP recombinase is derived from a species of 

15 the genus Saccharomyces. 

9. A recombination system according to Claim 1 
wherein the FLP recombinase is derived from a strain of 
Saccharomyces cerevisiae. 



20 



25 



30 



10. A recombination system according to Claim 9 
wherein said FLP recombinase is encoded by the 
approximately 1450 base pair sequence set forth as 
Sequence ID No. 1. 

11. A recombination system according to Claim i 
wherein said first DNA provides a readily analyzable 
marker feature to the host system. 

12. A recombination system according to Claim 2 
wherein said second DNA provides a readily analyzable 
marker feature to the host system. 
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13. A DNA construct comprising, as an 
autonomous fragment : 

(a) at least one FLP recombination target 
site, 

5 (b) at least one restriction endonuclease 

recognition site, 

(c) at least one marker gene, 

(d) a bacterial origin of replication, and 
optionally 

10 (e> a mammalian cellular or Viral origin of 

DNA replication. 

14. A DMA construct comprising, as an 
autonomous fragment, in the following order, reading from 

15 5 1 to 3 1 along said fragment: 

(a) a first FLP recombination target site, 

(b) an insert portion comprising, in any 
suitable sequence: 

(1) at least one restriction 

20 endonuclease recognition site, 

(2) at least one marker gene, 

(3) a bacterial origin of replication, 
and optionally 

(4) a mammalian cellular of viral origin 
25 of DNA replication, and 

(c) a second FLP recombination target site in 
tandem with said first FLP recombination 
target site. 

30 is. A method for the assembly of functional 

gene (s), which is (are) then suitable for activation of 
expression in mammalian cells, by recombination of 
individually inactive gene segments derived from one or 
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more gene (s) of interest, wherein each of said segments 
contains at least one recombination target site, said . 
method comprising: 

contacting said individually inactive gene 
5 segments with a FLP recombinase, under 

conditions suitable for recombination to occur, 

thereby providing a DNA sequence which encodes 

a functional gene of interest. 

10 16. A method according to Claim 15 wherein the 

FLP recombinase is derived from a species of the genus 
Saccharomyces. 

17. A method according to Claim 15 wherein the 
15 FLP recombinase is derived from a strain of Saccharomyces 

cerevisiae. 

18. A method according to Claim 17 wherein said 
FLP recombinase is encoded by the approximately 1450 Base 

20 pair sequence set forth as Sequence ID Ho. 1. 

19. A method for the disruption of functional 
gene(s) of interest, rendering said gene(s) unable to be 
inactivated for expression in mammalian cells wherein 

25 said gene(s) of interest contain at least one FLP 
recombination target site, said method comprising 
contacting said gene(s) of interest with: 

(i) a DNA segment which contains at least one 
FLP recombination target site, and 
30 (ilj FLP recombinase; 

wherein said contacting is carried out under conditions 
suitable for recombination to, occur between said gene and 
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said DNA segment , thereby disrupting the geiie(s) of 
interest and rendering said gene (s) non-functional. 

20. A method according to Claim 19 wherein said 
5 DNA segment provides a readily analyzable marker feature 

to the host system. 

21. A method according to Claim 19 wherein the 
FLP recombinase is derived from a species of the genus 

10 saccharomyces. 

■) ' . 

22. A method according to Claim 19 wherein the 
PEP recombinase is derived from a strain of Saccharomyces 
cerevisiae. 



15 



20 



23. A method according to Claim 22 wherein said 
FLP recombinase is encoded by the approximately 1450 base 
pair sequence' set forth as Sequence ID No. 1. 



24. A method for the recovery of transfected 
DNA from the genome of a transfected organism, wherein 
the genomic DNA of said transfected organism contains a 
fragment having two tandemly oriented FLP recombination 
target sites therein, said method comprising contacting 

25 genomic DNA from said organism with FLP. 

25. A method for the precisely targeted 
integration of DNA into the genome of a host organism, 
said method comprising: 

30 (i) introducing a FLP recombination target 

site into the genome of cells which are 
compatible with the cells of the subject. 
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introducing a- first DNA comprising a 
nucleotide sequence containing at least 
one FLP recombination target site therein 
into the FLP recombination target site in 
the genome of said cells by contacting 
said cells with said first DMA and FLP 
recombinase, and thereafter 
introducing the cells produced by the 
process of step (il) into said subject. 

26. A method according to Claim 25, further 
comprising contacting the genomic DNA from said subject 
with FLP, thereby recovering the transf ected DNA 
containing said first gene of interest from the genome of 

15 said transfected organism. 

27. A method according to Claim 26, further 
comprising introducing at least a portion of a second 
gene of interest into said FLP recombination target site. 

20 . 

28. A method according to Claim 25, further 
comprising introducing at least a portion of a second 
gene of interest into one of the FLP recombination target 
sites of said subject. 

29. A mammalian cell, wherein the genomic DNA 
of said cell contains at least one FLP recombination 
target site therein. 

30 30. A mammalian cell according to Claim 29 

wherein said FLP recombination target site in the genomic 
DNA of said cell is positioned within at least a portion 
of one or more gene(s) of interest. 



(ii> 



5 

(iii) 
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31. A manna] tain cell according to Claim 30, 
further comprising DNA encoding, and capable of 
expressing, in mammalian cells, a FLP recombinase. 

5 32. A mammalian cell according to Claim 30 

wherein said gene(s) of interest provide a readily 
analyzable marker feature to the host system. 

33. A mammalian cell according to Claim 29 
10 wherein said FLP recombination target site has the 

sequence: 

5 » HSAAGTTarCATTCTCTAGAAAGTAT 
15 or functional equivalents thereof. 

34. A mammalian cell according- to Claim 30 
further comprising an additional DNA fragment, wherein 
said additional DNA fragment is selected from: 

20 (a) at least a second portion of said 

first gene of interest, or 

(b) at least a portion of a second gene 
of interest; 

wherein said second DNA contains at least one FLP 
25 recombination target site; and wherein said second DNA, 
when combined in reading frame with said first DNA, 
provides a functional gene. 



30 



35. A transgenic, non-human mammal, wherein 
said mammal contains at least one FLP recombination 
target site in the genomic DNA thereof. 
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36. A transgenic, non-human mammal according to 
Claim 35 wherein said FLP recombination target site is 

< 

positioned within at least a portion of one or more 
gene(s) of interest. 

5 

37. A transgenic, non-human mammal according to 
Claim 35 , further comprising a nucleotide sequence 
encoding, and capable of expressing, in transgenic, non- 
human mammals, 1 a FLP recombinase. 

10 

38. A transgenic, non-human mammal according to 
Claim 35, further comprising FLP recombinase. 

39. A transgenic, non-human mammal according to 
15 Claim 36 wherein said gene(s) of interest provide a 

readily analyzable marker feature to the host system. 

40. A transgenic, non-human mammal according to 
Claim 35 wherein said FLP recombination target site has 

20 the sequence: 

5 ' -GAAGTTCCTATTCTCTAGAAAGTATA66AACTT, 
or functional equivalents thereof. 

25 

41. A transgenic, non-human mammal according to 
Claim 36 further comprising an additional DNA fragment, 
wherein said additional DNA fragment is selected frtpn: 

(a) at least a second portion of said 
30 first gene of interest, or 

(b) at least a portion of a second gene 
of interest; 

wherein said second DNA contains at least one FLP 
recombination target site; and wherein said second DNA, 
35 when combined in reading frame with said first DNA, 
provides a functional gene. 
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42. A method for the site-specific integration 
- of transf ected DNA into the genome of a cell according to 
Claim 29, said method comprising: 

(i) contacting said genome with: 
5 (a) FLP recombinase, and 

(b) a first DNA comprising at least a 
portion of a first gene of interest; 

wherein said first DNA contains at 
least one FLP recombination target site; 
10 and thereafter 

(ii) maintaining the product of step (i) under 
conditions suitable for site-specific 
integration of said DNA sequence to occur 
at the FLP recombination target site in 

15 said genome of the host cells. 

43. ! A method according to Claim 42 wherein said 
FLP recombination target site in the genomic DNA of said 
cell is positioned within at least a portion of one or 

20 more gene (s). of interest. 

44. A method according to Claim 42 further 
comprising additionally contacting said host cell with a 
second DNA, wherein said second DNA is selected from: 

25 (a) at least a second portion of said 

first gene of interest, or 

(b) at least a portion of a second gene 
of interest? 

wherein said second DNA contains at least one FLP 
30 recombination target site; and wherein said second DNA, 
when combined in reading frame with said first DNA, 
provides a functional gene. 



45. A method according to Claim 42 wherein said 
35 FLP recombinase is provided by a FLP expression vector. 
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46. A method according to Claim 45 wherein the 
expression of FLP recombinase by said FLP expression 
vector is subject: to regulatory control. 

47. A method according to claim 42 wherein said 
FLP recombination target site is introduced into the 
genome of said host mammalian cell by trans feet ing said 
host cell with a DNA fragment containing at least one 
recombination target site therein. 



48. A method according to Claim 42 wherein the 
FLP recombination target site in the genomic DNA of said 
host mammalian cell is so positioned that the 
introduction of additional DNA sequences therein will 

15 inactivate the target gene. 

49. A method for the site-specific integration 
of transfected DNA into the genome of a host according to 
Claim 35, said method comprising: 

20 (i) contacting said genome with: 

(a) FLP recombinase f and 

(b) a first DNA comprising a nucleotide 
sequence containing at least one FLP 
recombination target site therein; 

25 and thereafter 

(ii) maintaining the product of step (i) under 
conditions suitable for site-specific 
integration of said DNA sequence to occur 
at the FLP recombination target site in 

30 . said genome of the host. 

50 . , A method according to Claim 49 wherein said 
FLP recombination target site is positioned within at . 
least a portion of one or more gene(s) of interest. 

35 
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51. A method according to Claim 49 further 
comprising additionally contacting said host with a 
second DNA, wherein said second DMA is selected from: 

(a) at least a second portion of said 
5 first gene of interest, or 

(b) at least a portion of a second gene 
of interest; 

wherein said second DNA contains at least one FLP 
recombination target site; and wherein said second DNA, 
10 when combined in reading frame with said first DNA, 
provides a functional gene. N 

52. A method according to Claim 49 wherein said 
FLP recombinase is provided by a FLP expression vector. 

15 

53. A method according to Claim 52 wherein the 
expression of FLP recombinase by said FLP expression 
vector is subject to regulatory control. 

20 54.' A method according to Claim 49 wherein "said 

FLP recombination target site is introduced into the 
genome of said host mammal by trans fee ting said host with 
a DNA fragment containing at least one recombination 
target site therein. 

25 

55. A method according to Claim 49 wherein the 
DNA of said host mammal contains at least one FLP 
recombination target site, and wherein said FLP 
recombination target site is so positioned that the 
30 introduction of additional DNA sequences therein will 
inactivate the target gene. 
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56. A method for the analysis of the 
development of a mammal, said. method comprising:. 

(a) providing a transgenic mammal comprising: 

(i) an expression construct encoding FLP 
5 under the control of a conditional 

promoter, and 

(ii) a reporter construct under the 
control of the same or a different 
promoter, wherein said reporter 

10 construct encodes a functional or 

non- functional reporter gene 
product, and wherein said construct 
contains at least one FLP 
recombination target site therein, 

15 wherein the. functional 

expression of the functional 
reporter gene is disrupted when said 
FLP recombination event occurs, or 
wherein the . functional 

20 expression of the non-functional 

reporter gene commences when said 
FLP recombination event occurs; and 

(b) following the development of said mammal to 
determine when expression of functional reporter gene 

25 product either commences or is disrupted. 

57. A method according to Claim 56 wherein said 
conditional promoter is developmentally-regulated. 

30 58. A cd-transfection assay for the occurrence 

> of FLP-mediated recombination, said assay comprising: 
(a) co-transf ecting a host mammalian cell 

with: 

(i) a FLP expression plasmid, and 
35 (ii) a reporter plasmid comprising a non- 

functional reporter gene wherein said non- 
functional reporter gene is inactivated by 
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the presence of extraneous DNA containing 
at least one recombination target site; < 
and 

(b) monitoring said host cell under a variety 
5 of conditions for the gain of expression of functional 
reporter gene product. 

59. A co-transfection assay for the occurrence 
of FLP-mediated recombination, said assay comprising: 
10 (a) co-transf ecting a host mammalian cell 

with: 

(i) a FLP expression plasmid, and 

(ii) a reporter plasmid comprising a 
functional reporter gene containing at 

15 least one recombination target site 

therein; and 
(b) monitoring said host cell under a variety 
of conditions for the loss of expression of functional 
reporter gene product. 



20 
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